Generating Possible Interpretations for Statistics from Linked Open Data
نویسنده
چکیده
Statistics are very present in our daily lives. Every day, new statistics are published, showing the perceived quality of living in different cities, the corruption index of different countries, and so on. Interpreting those statistics, on the other hand, is a difficult task. Often, statistics collect only very few attributes, and it is difficult to come up with hypotheses that explain, e.g., why the perceived quality of living in one city is higher than in another. In this paper, we introduce Explain-a-LOD, an approach which uses data from Linked Open Data for generating hypotheses that explain statistics. We show an implemented prototype and compare different approaches for generating hypotheses by analyzing the perceived quality of those hypotheses in a user study.
منابع مشابه
Analyzing Statistics with Background Knowledge from Linked Open Data
Background knowledge from Linked Open Data sources, such as DBpedia, Eurostat, and GADM, can be used to create both interpretations and advanced visualizations of statistical data. In this paper, we discuss methods of linking statistical data to Linked Open Data sources and the use of the Explain-a-LOD toolkit. The paper further shows exemplary findings and visualizations created by combining t...
متن کاملProfiling Linked (Open) Data
The number of datasets published as Linked (Open) Data is constantly increasing with roughly 1000 datasets as of April 2014. Despite this number of published datasets, their usage is still not exploited as they lack comprehensive and up to date metadeta. The metadata hold significant information not only to understand the data at hand but they also provide useful information to the cleansing an...
متن کاملApplication of Open Data for Official Statistics, Case Study Data of Instagram Social Network
Abstract. Open data notion is based on the idea that emphasizes on free access of users to data to reuse them on their own and republish the result far from some restrictions of copyright, patent etc. Due to the ever increasing trend of Information and Communication Technology (ICT), more data is producing every day and this brings brilliant opportunity for National Statistical Offices (NSOs) ...
متن کاملRecurrence Relations for Moment Generating Functions of Generalized Order Statistics Based on Doubly Truncated Class of Distributions
In this paper, we derived recurrence relations for joint moment generating functions of nonadjacent generalized order statistics (GOS) of random samples drawn from doubly truncated class of continuous distributions. Recurrence relations for joint moments of nonadjacent GOS (ordinary order statistics (OOS) and k-upper records (k-RVs) as special cases) are obtained. Single and product moment gene...
متن کاملFast Generation of Deviates for Order Statistics by an Exact Method
We propose an exact method for generating random deviates from continuous order statistics. This versatile method that generates Beta deviates as a middle step can be applied to any density function without resorting to numerical inversion. We also conduct an exhaustive investigation to document the merits of our method in generating deviates from any Beta distribution.
متن کامل